Picture for Jinbin Bai

Jinbin Bai

Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design

Add code
Feb 04, 2026
Viaarxiv icon

Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

Add code
Feb 02, 2026
Viaarxiv icon

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Figure 1 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 2 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 3 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 4 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Viaarxiv icon

RecTok: Reconstruction Distillation along Rectified Flow

Add code
Dec 17, 2025
Viaarxiv icon

EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing

Add code
Dec 12, 2025
Viaarxiv icon

From Masks to Worlds: A Hitchhiker's Guide to World Models

Add code
Oct 23, 2025
Viaarxiv icon

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Add code
May 29, 2025
Viaarxiv icon

Conditional Panoramic Image Generation via Masked Autoregressive Modeling

Add code
May 22, 2025
Figure 1 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 2 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 3 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Figure 4 for Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Viaarxiv icon

An Empirical Study of GPT-4o Image Generation Capabilities

Add code
Apr 08, 2025
Figure 1 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 2 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 3 for An Empirical Study of GPT-4o Image Generation Capabilities
Figure 4 for An Empirical Study of GPT-4o Image Generation Capabilities
Viaarxiv icon

Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer

Add code
Mar 21, 2025
Viaarxiv icon